IDa-Det: An Information Discrepancy-Aware Distillation for 1-bit Detectors
171
where LGT is the detection loss derived from the ground truth label and LLim is the fine-
grained feature limitation defined in [235]. The LWS-Det process is outlined in Algorithm
13.
6.4.5
Ablation Study
Effectiveness of DBS. We first compare our DBS method with three other methods to
produce binarized weights–Random Search [277], Sign [99], and RSign [158]. As shown in
Table 6.4, we evaluate the effectiveness of DBS on two detectors: one-stage SSD and two-
stage Faster-RCNN. On the Faster-RCNN detector, the usage of DBS improves the mAP
by 8.1%, 4.3%, and 9.1% compared to Sign, RSign, and Random Search, respectively, under
the same student-teacher framework. On the SSD detector, DBS also enhances mAP by
5.5%, 3.3% and 11.3% compared to other binarization methods, respectively, which is very
significant for the object detection task.
Convergence analysis. We evaluate the convergence of detection loss during the training
process compared to other situations on two detectors: Faster-RCNN with ResNet-18 back-
bone and SSD with VGG-16 backbone. As plotted in Fig. 6.12, the LWS-Det training curve
based on random search oscillates vigorously, which is suspected to be triggered by a less
optimized angular error resulting from the randomly searched binary weights. Additionally,
our DBS achieves a minimum loss during training compared to Sign and RSign. This also
confirms that our DBS method can binarize the weights with minimum angular error, which
explains the best performance in Table 6.4.
6.5
IDa-Det: An Information Discrepancy-Aware Distillation for
1-bit Detectors
The recent art [264] employs fine-grained feature imitation (FGFI) [235] to enhance the
performance of 1-bit detectors. However, it neglects the intrinsic information discrepancy
between 1-bit detectors and real-valued detectors. As shown in Fig. 6.13, we demonstrate
that saliency maps of real-valued Faster-RCNN of the ResNet-101 backbone (often used as
the teacher network) and the ResNet-18 backbone, compared to 1-bit Faster-RCNN of the
ResNet-18 backbone (often used as the student network) from top to bottom. They show
TABLE 6.4
Ablation study: comparison of the performance of different
binarization methods with DBS.
Framework
Backbone
Binarization Method
mAP
Faster-RCNN
ResNet-18
Sign
65.1
RSign
68.9
Random Search
64.1
DBS
73.2
Real-valued
76.4
SSD
VGG-16
Sign
65.9
RSign
68.1
Random Search
60.1
DBS
71.4
Real-valued
74.3